Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

SmartDoc-QA: A Dataset for Quality Assessment of Smartphone Captured Document Images - Single and Multiple Distortions

Identifieur interne : 000065 ( Main/Exploration ); précédent : 000064; suivant : 000066

SmartDoc-QA: A Dataset for Quality Assessment of Smartphone Captured Document Images - Single and Multiple Distortions

Auteurs : Nibal Nayef [France] ; Muhammad Muzzamil Luqman [France] ; Sophea Prum [France] ; Sébastien Eskenazi [France] ; Joseph Chazalon [France] ; Jean-Marc Ogier [France]

Source :

RBID : Hal:hal-01319900

Abstract

Smartphones are enabling new ways of capture,hence arises the need for seamless and reliable acquisition anddigitization of documents. The quality assessment step is animportant part of both the acquisition and the digitizationprocesses. Assessing document quality could aid users during thecapture process or help improve image enhancement methodsafter a document has been captured. Current state-of-the-artworks lack databases in the field of document image qualityassessment. In order to provide a baseline benchmark for qualityassessment methods for mobile captured documents, we presentin this paper a dataset for quality assessment that contains bothsingly- and multiply-distorted document images.The proposed dataset could be used for benchmarking qualityassessment methods by the objective measure of OCR accuracy,and could be also used to benchmark quality enhancementmethods. There are three types of documents in the dataset:modern documents, old administrative letters and receipts. Thedocument images of the dataset are captured under varyingcapture conditions (light, different types of blur and perspectiveangles). This causes geometric and photometric distortions thathinder the OCR process. The ground truth of the datasetimages consists of the text transcriptions of the documents,the OCR results of the captured documents and the values ofthe different capture parameters used for each image. We alsopresent how the dataset could be used for evaluation in thefield of no-reference quality assessment. The dataset is freelyand publicly available for use by the research community athttp://navidomass.univ-lr.fr/SmartDoc-QA.

Url:


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">SmartDoc-QA: A Dataset for Quality Assessment of Smartphone Captured Document Images - Single and Multiple Distortions</title>
<author>
<name sortKey="Nayef, Nibal" sort="Nayef, Nibal" uniqKey="Nayef N" first="Nibal" last="Nayef">Nibal Nayef</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-40831" status="VALID">
<orgName>Laboratoire Informatique, Image et Interaction</orgName>
<orgName type="acronym">L3I</orgName>
<desc>
<address>
<addrLine>Bâtiment Pascal Avenue Michel Crépeau F-17042 La Rochelle Cedex 1</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-lr.fr/l3i</ref>
</desc>
<listRelation>
<relation name="EA2118" active="#struct-300311" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle name="EA2118" active="#struct-300311" type="direct">
<org type="institution" xml:id="struct-300311" status="VALID">
<orgName>Université de La Rochelle</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
<placeName>
<settlement type="city">La Rochelle</settlement>
<region type="region" nuts="2">Poitou-Charentes</region>
</placeName>
<orgName type="university">Université de La Rochelle</orgName>
</affiliation>
</author>
<author>
<name sortKey="Luqman, Muhammad Muzzamil" sort="Luqman, Muhammad Muzzamil" uniqKey="Luqman M" first="Muhammad Muzzamil" last="Luqman">Muhammad Muzzamil Luqman</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-40831" status="VALID">
<orgName>Laboratoire Informatique, Image et Interaction</orgName>
<orgName type="acronym">L3I</orgName>
<desc>
<address>
<addrLine>Bâtiment Pascal Avenue Michel Crépeau F-17042 La Rochelle Cedex 1</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-lr.fr/l3i</ref>
</desc>
<listRelation>
<relation name="EA2118" active="#struct-300311" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle name="EA2118" active="#struct-300311" type="direct">
<org type="institution" xml:id="struct-300311" status="VALID">
<orgName>Université de La Rochelle</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
<placeName>
<settlement type="city">La Rochelle</settlement>
<region type="region" nuts="2">Poitou-Charentes</region>
</placeName>
<orgName type="university">Université de La Rochelle</orgName>
</affiliation>
</author>
<author>
<name sortKey="Prum, Sophea" sort="Prum, Sophea" uniqKey="Prum S" first="Sophea" last="Prum">Sophea Prum</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-40831" status="VALID">
<orgName>Laboratoire Informatique, Image et Interaction</orgName>
<orgName type="acronym">L3I</orgName>
<desc>
<address>
<addrLine>Bâtiment Pascal Avenue Michel Crépeau F-17042 La Rochelle Cedex 1</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-lr.fr/l3i</ref>
</desc>
<listRelation>
<relation name="EA2118" active="#struct-300311" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle name="EA2118" active="#struct-300311" type="direct">
<org type="institution" xml:id="struct-300311" status="VALID">
<orgName>Université de La Rochelle</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
<placeName>
<settlement type="city">La Rochelle</settlement>
<region type="region" nuts="2">Poitou-Charentes</region>
</placeName>
<orgName type="university">Université de La Rochelle</orgName>
</affiliation>
</author>
<author>
<name sortKey="Eskenazi, Sebastien" sort="Eskenazi, Sebastien" uniqKey="Eskenazi S" first="Sébastien" last="Eskenazi">Sébastien Eskenazi</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-40831" status="VALID">
<orgName>Laboratoire Informatique, Image et Interaction</orgName>
<orgName type="acronym">L3I</orgName>
<desc>
<address>
<addrLine>Bâtiment Pascal Avenue Michel Crépeau F-17042 La Rochelle Cedex 1</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-lr.fr/l3i</ref>
</desc>
<listRelation>
<relation name="EA2118" active="#struct-300311" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle name="EA2118" active="#struct-300311" type="direct">
<org type="institution" xml:id="struct-300311" status="VALID">
<orgName>Université de La Rochelle</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
<placeName>
<settlement type="city">La Rochelle</settlement>
<region type="region" nuts="2">Poitou-Charentes</region>
</placeName>
<orgName type="university">Université de La Rochelle</orgName>
</affiliation>
</author>
<author>
<name sortKey="Chazalon, Joseph" sort="Chazalon, Joseph" uniqKey="Chazalon J" first="Joseph" last="Chazalon">Joseph Chazalon</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-40831" status="VALID">
<orgName>Laboratoire Informatique, Image et Interaction</orgName>
<orgName type="acronym">L3I</orgName>
<desc>
<address>
<addrLine>Bâtiment Pascal Avenue Michel Crépeau F-17042 La Rochelle Cedex 1</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-lr.fr/l3i</ref>
</desc>
<listRelation>
<relation name="EA2118" active="#struct-300311" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle name="EA2118" active="#struct-300311" type="direct">
<org type="institution" xml:id="struct-300311" status="VALID">
<orgName>Université de La Rochelle</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
<placeName>
<settlement type="city">La Rochelle</settlement>
<region type="region" nuts="2">Poitou-Charentes</region>
</placeName>
<orgName type="university">Université de La Rochelle</orgName>
</affiliation>
</author>
<author>
<name sortKey="Ogier, Jean Marc" sort="Ogier, Jean Marc" uniqKey="Ogier J" first="Jean-Marc" last="Ogier">Jean-Marc Ogier</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-40831" status="VALID">
<orgName>Laboratoire Informatique, Image et Interaction</orgName>
<orgName type="acronym">L3I</orgName>
<desc>
<address>
<addrLine>Bâtiment Pascal Avenue Michel Crépeau F-17042 La Rochelle Cedex 1</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-lr.fr/l3i</ref>
</desc>
<listRelation>
<relation name="EA2118" active="#struct-300311" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle name="EA2118" active="#struct-300311" type="direct">
<org type="institution" xml:id="struct-300311" status="VALID">
<orgName>Université de La Rochelle</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
<placeName>
<settlement type="city">La Rochelle</settlement>
<region type="region" nuts="2">Poitou-Charentes</region>
</placeName>
<orgName type="university">Université de La Rochelle</orgName>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">HAL</idno>
<idno type="RBID">Hal:hal-01319900</idno>
<idno type="halId">hal-01319900</idno>
<idno type="halUri">https://hal.archives-ouvertes.fr/hal-01319900</idno>
<idno type="url">https://hal.archives-ouvertes.fr/hal-01319900</idno>
<date when="2014-08-24">2014-08-24</date>
<idno type="wicri:Area/Hal/Corpus">000109</idno>
<idno type="wicri:Area/Hal/Curation">000109</idno>
<idno type="wicri:Area/Hal/Checkpoint">000023</idno>
<idno type="wicri:Area/Main/Merge">000065</idno>
<idno type="wicri:Area/Main/Curation">000065</idno>
<idno type="wicri:Area/Main/Exploration">000065</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en">SmartDoc-QA: A Dataset for Quality Assessment of Smartphone Captured Document Images - Single and Multiple Distortions</title>
<author>
<name sortKey="Nayef, Nibal" sort="Nayef, Nibal" uniqKey="Nayef N" first="Nibal" last="Nayef">Nibal Nayef</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-40831" status="VALID">
<orgName>Laboratoire Informatique, Image et Interaction</orgName>
<orgName type="acronym">L3I</orgName>
<desc>
<address>
<addrLine>Bâtiment Pascal Avenue Michel Crépeau F-17042 La Rochelle Cedex 1</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-lr.fr/l3i</ref>
</desc>
<listRelation>
<relation name="EA2118" active="#struct-300311" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle name="EA2118" active="#struct-300311" type="direct">
<org type="institution" xml:id="struct-300311" status="VALID">
<orgName>Université de La Rochelle</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
<placeName>
<settlement type="city">La Rochelle</settlement>
<region type="region" nuts="2">Poitou-Charentes</region>
</placeName>
<orgName type="university">Université de La Rochelle</orgName>
</affiliation>
</author>
<author>
<name sortKey="Luqman, Muhammad Muzzamil" sort="Luqman, Muhammad Muzzamil" uniqKey="Luqman M" first="Muhammad Muzzamil" last="Luqman">Muhammad Muzzamil Luqman</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-40831" status="VALID">
<orgName>Laboratoire Informatique, Image et Interaction</orgName>
<orgName type="acronym">L3I</orgName>
<desc>
<address>
<addrLine>Bâtiment Pascal Avenue Michel Crépeau F-17042 La Rochelle Cedex 1</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-lr.fr/l3i</ref>
</desc>
<listRelation>
<relation name="EA2118" active="#struct-300311" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle name="EA2118" active="#struct-300311" type="direct">
<org type="institution" xml:id="struct-300311" status="VALID">
<orgName>Université de La Rochelle</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
<placeName>
<settlement type="city">La Rochelle</settlement>
<region type="region" nuts="2">Poitou-Charentes</region>
</placeName>
<orgName type="university">Université de La Rochelle</orgName>
</affiliation>
</author>
<author>
<name sortKey="Prum, Sophea" sort="Prum, Sophea" uniqKey="Prum S" first="Sophea" last="Prum">Sophea Prum</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-40831" status="VALID">
<orgName>Laboratoire Informatique, Image et Interaction</orgName>
<orgName type="acronym">L3I</orgName>
<desc>
<address>
<addrLine>Bâtiment Pascal Avenue Michel Crépeau F-17042 La Rochelle Cedex 1</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-lr.fr/l3i</ref>
</desc>
<listRelation>
<relation name="EA2118" active="#struct-300311" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle name="EA2118" active="#struct-300311" type="direct">
<org type="institution" xml:id="struct-300311" status="VALID">
<orgName>Université de La Rochelle</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
<placeName>
<settlement type="city">La Rochelle</settlement>
<region type="region" nuts="2">Poitou-Charentes</region>
</placeName>
<orgName type="university">Université de La Rochelle</orgName>
</affiliation>
</author>
<author>
<name sortKey="Eskenazi, Sebastien" sort="Eskenazi, Sebastien" uniqKey="Eskenazi S" first="Sébastien" last="Eskenazi">Sébastien Eskenazi</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-40831" status="VALID">
<orgName>Laboratoire Informatique, Image et Interaction</orgName>
<orgName type="acronym">L3I</orgName>
<desc>
<address>
<addrLine>Bâtiment Pascal Avenue Michel Crépeau F-17042 La Rochelle Cedex 1</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-lr.fr/l3i</ref>
</desc>
<listRelation>
<relation name="EA2118" active="#struct-300311" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle name="EA2118" active="#struct-300311" type="direct">
<org type="institution" xml:id="struct-300311" status="VALID">
<orgName>Université de La Rochelle</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
<placeName>
<settlement type="city">La Rochelle</settlement>
<region type="region" nuts="2">Poitou-Charentes</region>
</placeName>
<orgName type="university">Université de La Rochelle</orgName>
</affiliation>
</author>
<author>
<name sortKey="Chazalon, Joseph" sort="Chazalon, Joseph" uniqKey="Chazalon J" first="Joseph" last="Chazalon">Joseph Chazalon</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-40831" status="VALID">
<orgName>Laboratoire Informatique, Image et Interaction</orgName>
<orgName type="acronym">L3I</orgName>
<desc>
<address>
<addrLine>Bâtiment Pascal Avenue Michel Crépeau F-17042 La Rochelle Cedex 1</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-lr.fr/l3i</ref>
</desc>
<listRelation>
<relation name="EA2118" active="#struct-300311" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle name="EA2118" active="#struct-300311" type="direct">
<org type="institution" xml:id="struct-300311" status="VALID">
<orgName>Université de La Rochelle</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
<placeName>
<settlement type="city">La Rochelle</settlement>
<region type="region" nuts="2">Poitou-Charentes</region>
</placeName>
<orgName type="university">Université de La Rochelle</orgName>
</affiliation>
</author>
<author>
<name sortKey="Ogier, Jean Marc" sort="Ogier, Jean Marc" uniqKey="Ogier J" first="Jean-Marc" last="Ogier">Jean-Marc Ogier</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-40831" status="VALID">
<orgName>Laboratoire Informatique, Image et Interaction</orgName>
<orgName type="acronym">L3I</orgName>
<desc>
<address>
<addrLine>Bâtiment Pascal Avenue Michel Crépeau F-17042 La Rochelle Cedex 1</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-lr.fr/l3i</ref>
</desc>
<listRelation>
<relation name="EA2118" active="#struct-300311" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle name="EA2118" active="#struct-300311" type="direct">
<org type="institution" xml:id="struct-300311" status="VALID">
<orgName>Université de La Rochelle</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
<placeName>
<settlement type="city">La Rochelle</settlement>
<region type="region" nuts="2">Poitou-Charentes</region>
</placeName>
<orgName type="university">Université de La Rochelle</orgName>
</affiliation>
</author>
</analytic>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass></textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Smartphones are enabling new ways of capture,hence arises the need for seamless and reliable acquisition anddigitization of documents. The quality assessment step is animportant part of both the acquisition and the digitizationprocesses. Assessing document quality could aid users during thecapture process or help improve image enhancement methodsafter a document has been captured. Current state-of-the-artworks lack databases in the field of document image qualityassessment. In order to provide a baseline benchmark for qualityassessment methods for mobile captured documents, we presentin this paper a dataset for quality assessment that contains bothsingly- and multiply-distorted document images.The proposed dataset could be used for benchmarking qualityassessment methods by the objective measure of OCR accuracy,and could be also used to benchmark quality enhancementmethods. There are three types of documents in the dataset:modern documents, old administrative letters and receipts. Thedocument images of the dataset are captured under varyingcapture conditions (light, different types of blur and perspectiveangles). This causes geometric and photometric distortions thathinder the OCR process. The ground truth of the datasetimages consists of the text transcriptions of the documents,the OCR results of the captured documents and the values ofthe different capture parameters used for each image. We alsopresent how the dataset could be used for evaluation in thefield of no-reference quality assessment. The dataset is freelyand publicly available for use by the research community athttp://navidomass.univ-lr.fr/SmartDoc-QA.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>France</li>
</country>
<region>
<li>Poitou-Charentes</li>
</region>
<settlement>
<li>La Rochelle</li>
</settlement>
<orgName>
<li>Université de La Rochelle</li>
</orgName>
</list>
<tree>
<country name="France">
<region name="Poitou-Charentes">
<name sortKey="Nayef, Nibal" sort="Nayef, Nibal" uniqKey="Nayef N" first="Nibal" last="Nayef">Nibal Nayef</name>
</region>
<name sortKey="Chazalon, Joseph" sort="Chazalon, Joseph" uniqKey="Chazalon J" first="Joseph" last="Chazalon">Joseph Chazalon</name>
<name sortKey="Eskenazi, Sebastien" sort="Eskenazi, Sebastien" uniqKey="Eskenazi S" first="Sébastien" last="Eskenazi">Sébastien Eskenazi</name>
<name sortKey="Luqman, Muhammad Muzzamil" sort="Luqman, Muhammad Muzzamil" uniqKey="Luqman M" first="Muhammad Muzzamil" last="Luqman">Muhammad Muzzamil Luqman</name>
<name sortKey="Ogier, Jean Marc" sort="Ogier, Jean Marc" uniqKey="Ogier J" first="Jean-Marc" last="Ogier">Jean-Marc Ogier</name>
<name sortKey="Prum, Sophea" sort="Prum, Sophea" uniqKey="Prum S" first="Sophea" last="Prum">Sophea Prum</name>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000065 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 000065 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     Hal:hal-01319900
   |texte=   SmartDoc-QA: A Dataset for Quality Assessment of Smartphone Captured Document Images - Single and Multiple Distortions
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024